Modelling pronominal anaphora in statistical machine translation
نویسندگان
چکیده
Current Statistical Machine Translation (SMT) systems translate texts sentence by sentence without considering any cross-sentential context. Assuming independence between sentences makes it difficult to take certain translation decisions when the necessary information cannot be determined locally. We argue for the necessity to include crosssentence dependencies in SMT. As a case in point, we study the problem of pronominal anaphora translation by manually evaluating German-English SMT output. We then present a word dependency model for SMT, which can represent links between word pairs in the same or in different sentences. We use this model to integrate the output of a coreference resolution system into English-German SMT with a view to improving the translation of anaphoric pronouns.
منابع مشابه
Application of Pronominal Divergence and Anaphora Resolution in English-Hindi Machine Translation
So far the majority of Machine Translation (MT) research has focused on translation at the level of individual sentences. For sentence level translation, Machine Translation has addressed various divergence issues for large variety of languages; the issue of pronominal divergence has been presented only recently. Since the quality of translation as required by users follows coherent multi-sente...
متن کاملPronominal Anaphora in Machine Translation
State-of-the-art machine translation systems use strong assumptions of independence. Following these assumptions language is split into small segments such as sentences and phrases which are translated independently. Natural language, however, is not independent: many concepts depend on context. One such case is reference introduced by pronominal anaphora. In pronominal anaphora a pronoun word ...
متن کاملProposal of an English-Spanish Interlingual Mechanism Focused on Pronominal Anaphora Resolution and Generation in Machine Translation Systems
In this paper an interlingual mechanism oriented to pronominal references resolution and generation in Machine Translation (MT) systems is proposed. This mechanism is based on Slot Structure (SS) presented in [3] [2]. A comparison of pronominal references resolution both in English and in Spanish is developed to accomplish a study of the existing discrepancies between two languages. From this s...
متن کاملExploring Semantic Information from Hindi Dependency Treebank for Resolving Pronominal Anaphora
Anaphora Resolution is exigent task in almost all NLP applications such as text summarization, machine translation, information extraction, question-answering systems, etc. A lot of work has been done for identifying and still more need to be done for finding the factors responsible for resolving the anaphoras in all languages. An attempt has been made to resolve Hindi pronominal anaphora using...
متن کاملCoreference-Oriented Interlingual Slot Structure And Machine Translation
One of the main problems of many commercial Machine Translation (MT) and experimental systems is that they do not carry out a correct pronominal anaphora generation. As mentioned in Mitkov (1996), solving the anaphora and extracting the antecedent are key issues in a correct translation. In this paper, we propose an Interlingual mechanism that we have called lnterlingual Slot Structure (ISS) ba...
متن کامل